Using Conjunctions and Adverbs for Author Verification

نویسندگان

  • Daniel Pavelec
  • Luiz Eduardo Soares de Oliveira
  • Edson José Rodrigues Justino
  • Leonardo Vidal Batista
چکیده

Linguistics and stylistics have been investigated for author identification for quite a while, but recently, we have testified a impressive growth in the volume with which lawyers and courts have called upon the expertise of linguists in cases of disputed authorship. This motivates computer science researchers to look to the problem of author identification from a different perspective. In this work, we propose a stylometric feature set based on conjunctions and adverbs of the Portuguese language to address the problem of author identification. Two different approaches of classification were considered. The first one is called writer-independent and it reduces the pattern recognition problem to a single model and two classes, hence, makes it possible to build robust system even when few genuine samples per writer are available. The second one is called the personal model, or writer-dependent, which very often performs better but needs a bigger number of samples per writer. Experiments on a database composed of short articles from 30 different authors and Support Vector Machine (SVM) as classifier demonstrate that the proposed strategy can produced results comparable to the literature.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Linguistic Study on the Translation of Parvin E’tesami’s Poems into English Using Catford’s Category Shifts

The present study aimed to investigate the translation into English by Alaeddin Pazargadi of Parvin E’tesami’s poems; in particular, it attempted to analyze the structural elements such as verbs, nouns, pronouns, adjectives, adverbs, articles, conjunctions, prepositions, and interjections in them. Considering the relationship between Linguistics and Translation Studies, the theoretical framewor...

متن کامل

An integrated model for the treatment of time in machine translation systems

One of the ways to achieve a good translation of verbal forms is the morphosp~tactic approach, which consists in a function pairing the different morphological tenses that occur in a given language with the tenses of the other language. Complicated rules must be established to calculate the right pair for an expression, because of the amount of discrepancies that differmnt languages show with r...

متن کامل

Automatic Verification of Iterated Separating Conjunctions Using Symbolic Execution

In permission logics such as separation logic, the iterated separating conjunction is a quantifier denoting access permission to an unbounded set of heap locations. In contrast to recursive predicates, iterated separating conjunctions do not prescribe a structure on the locations they range over, and so do not restrict how to traverse and modify these locations. This flexibility is important fo...

متن کامل

Investigating the Use of Paratactic and Hypotactic Conjunctions among Iranian Pre-university Students

In an attempt to dispel the persisting fallacy that an individual’s grammar knowledge is indicative of the way they put this knowledge into practice, this study seeks to highlight the inconsistency which resides between one’s competence and performance in the domain of conjunctions. It aims to shed light on the discrepancy which lies between the knowledge and production of conjunctions. The res...

متن کامل

A presentation of Operational Methodology

The author introduces Operational Methodology (O.M.), a human mind studying method which is radically new in regard to traditional methods, i.e. those of neural biology, cognitive psychology, linguistics, artificial intelligence and philosophy. According to the author O.M. is a decisive step in understanding the human mind and through it we could succeed in the extremely difficult task of the a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. UCS

دوره 14  شماره 

صفحات  -

تاریخ انتشار 2008